Published online 05 April 2014 



Nucleic Acids Research, 2014, Vol 42, No. 10 6673-6685 

doi: 10.1093lnarlgku249 



NAR Breakthrough Article 

Crystal structure of a 4-thiouridine synthetase-RNA 
complex reveals specificity of tRNA U8 modification 

Piotr Neumann^ Kristina Lakomek^ t, Peter-Thomas Naumann\ Whitney M. Erwin^, 
Charles T. Lauhon^ and Ralf Ficner^ * 

^Department of Molecular Structural Biology, Institute of Microbiology and Genetics, GZMB, University of Gottingen, 
37077 Gottingen, Germany and ^School of Pharmacy, University of Wisconsin, Madison, Wl 53705, USA 

Received January 9, 2014; Revised IVIarch 1 1 , 2014; Accepted IVIarch 13, 2014 



ABSTRACT 

In prokaryotes and archaea transfer ribonucleic acid 
(tRNA) stability as well as cellular UV protection re- 
lies on the post-transcriptional modification of uracil 
at position 8 (U8) of tRNAs by the 4-thiouridine syn- 
thetase Thil. Here, we report three crystal structures 
of Thil from Thermotoga maritima in complex with 
a truncated tRNA. The RNA is mainly bound by the 
N-terminal ferredoxin-like domain (NFLD) and the 
THUMP domain of one subunit within the Thil homo- 
dimer thereby positioning the US close to the cat- 
alytic center in the pyrophosphatase domain of the 
other subunit. The recognition of the 3'-CCA end by 
the THUMP domain yields a molecular ruler defin- 
ing the specificity for U8 thiolation. This first struc- 
ture of a THUMP/NFLD-RNA complex might serve 
as paradigm for the RNA recognition by THUMP do- 
mains of other proteins. The ternary Thil-RNA-ATP 
complex shows no significant structural changes 
due to adenosine triphosphate (ATP) binding, but two 
different states of active site loops are observed in- 
dependent of the nucleotide loading state. Thereby 
conformational changes of the active site are cou- 
pled with conformational changes of the bound RNA. 
The Thil-RNA complex structures indicate that full- 
length tRNA has to adopt a non-canonical conforma- 
tion upon binding to Thil. 



INTRODUCTION 

In all domains of life ribonucleic acid (RNA) maturation in- 
cludes the post-transcriptional modification of nucleosides 
(1-3). The widest variety and largest number of modified 
nucleosides have been identified in transfer RNAs (tRNAs). 
Among them are 16 different thio-nucleotides including the 
4-thiouridine (s'^U), which is ubiquitously found and always 
located at position 8 in the loop region connecting the ac- 
ceptor stem and the D stem of eubacterial and archaeal tR- 
NAs. s'^US does not only stabilize the fold of the tRNA 
(4,5), but additionally plays a central role in bacterial UV 
protection as it acts as a sensor for near-UV radiation (6- 

The biosynthesis of s^U8 is catalyzed by 4-thiouridine 
synthetase (Thil) (10), which consists at least of three do- 
mains: an N-terminal ferredoxin-like domain (NFLD), a 
THUMP domain and a pyrophosphatase (PPase) domain. 
In 7 -proteobacteria, such as Escherichia coli, and in species 
of the archaeon Thermoplasma, Thil contains a fourth, 
C-terminal rhodanese-like domain (RLD) (11). The crys- 
tal structures of Thil from Bacillus anthracis (Thil^^) and 
of a putative Thil ortholog from the archaeon Pyrococcus 
horikoshii have been solved (12,13). Based on the Thil^^ 
structure and the results of biochemical studies, a func- 
tional assignment of the three common domains (NFLD, 
THUMP, PPase) has been proposed (12,14). While the 
NFLD and the THUMP domain form the major bipar- 
tite RNA binding module recognizing the tRNA acceptor 
stem region, the PPase domain binds adenosine triphos- 
phate (ATP) by the PP-loop motif SGGxDS and contains 
the catalytic center (12,15). 
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The catalytic mechanism of 4-thiouridine synthesis is not 
yet understood in detail, though a multi-step reaction mech- 
anism was proposed for Thil from E. coli (Thil^c) requiring 
the cysteine desulfurase IscS (16-19). The sulfur transfer 
from cysteine to Thil involves two consecutive formations 
of an internal persulfide at a cysteine residue of firstly IscS 
and secondly of the RLD of Thil (18). After activation of 
U8 by adenylation, catalyzed by Thil's PPase domain, two 
reaction pathways were considered: either the cysteine per- 
sulfide of the RLD performs a direct nucleophilic attack on 
the adenylated US or the terminal sulfur of the persulfide is 
liberated as hydrogen sulfide, which serves as a nucleophile 
to displace the activated oxygen of the uracil base (1 1,19). In 
both scenarios, finally adenosine monophosphate (AMP) 
and s^U8-tRNA are released, and a disulfide bond between 
the cysteine of the RLD and a cysteine of the PPase do- 
main are formed (20). For the regeneration of the active en- 
zyme this disulfide bond has to be reduced, hence, in vitro 
Thil catalyzes only a single turnover in the absence of a re- 
ducing agent (20,21). In contrast, Thil orthologs lacking 
the RLD seem to apply a different mechanism as recently 
demonstrated for Thil orthologs from Bacillus subtilis (22) 
and Methanococcus maripaludis (23) involving the sulfur- 
transferase NifZ or IscS, respectively. Here, the persulfide is 
generated on a cysteine of the PPase domain, and finally a 
disulfide is formed with another cysteine of the PPase ac- 
tive site. This mechanism is reminiscent of that of the 2- 
thiouridine synthetase MnmA, which modifies U34 in the 
anticodon of some tRNAs (24). 

Since the target U8 is buried in the core of canonical L- 
form tRNA, conformational changes have to occur in or- 
der to make it accessible for Thil (14), but so far it has 
been unknown how US is specifically recognized. A trun- 
cated tRNA consisting of 39 nucleotides (TPHE39A) was 
previously shown to serve as minimal substrate for s^U8 
synthesis (25). It corresponds to the acceptor and T-stems 
of substrate tRNA^^^ from E. coli as nucleotides Gl -A7 and 
U29-A39 of TPHE39A match G1-A7 and U66-C72 of the 
acceptor-stem of tRNAP^% and C15-G28 of TPHE39A are 
equivalent to C49-G65 in the T-stem of tRNA^^^ (Supple- 
mentary Figure SI). The bulge (US -CI 4) connecting both 
stems of TPHE39A is composed of the target U8, the adja- 
cent A9 and the variable loop sequence G10-C14, which is 
equivalent to G44-C48 of the full-length tRNA^^^ Surpris- 
ingly, even though the 3D structure of TPHE39A differs in 
the T-stem and T-loop in comparison to full-length tRNA 
(14), it can be specifically modified by Thil (25). 

In order to gain insight into the RNA binding mode 
of Thil and to understand the structural basis of Thil's 
specificity for both tRNA and the target US, we deter- 
mined the crystal structure of Thil from Thermo toga mar- 
itima (Thil^m) in complex with the truncated tRNA^^^ 
(TPHE39A) from E. coli. With regard to the crystal struc- 
ture of Thil^^, the structure of each domain of Thil7>„ re- 
mains unchanged upon RNA binding, but their relative ori- 
entation to each other changes considerably. The tRNA ac- 
ceptor stem and the crucial single-stranded ACCA 3 '-end 
form multiple interactions with the NFLD and THUMP 
domain of one monomer, respectively. This binding mode 
enables correct positioning of U8 close to the active site in 
the PPase domain of the other subunit within the homo- 



dimeric Thil. The complex structure also suggests that the 
conformation of full-length substrate tRNA has to largely 
deviate from the canonical L-shape for productive binding. 

MATERIALS AND METHODS 

Crystallization 

The overexpression and purification of Thil7>„, and the 
preparation and crystallization of the Thil7)^-RNA com- 
plex, has previously been described (26). Briefly, 1 |jl1 of 
complex (Thil-RNA complex 10 mg/ml, 20 mM Tris/HCl, 
pH 7.5, 150 mM (NH4)2S04) was mixed with 3 |jl1 of reser- 
voir (2 M sodium formate, 100 mM sodium citrate, pH 4.6, 
2 mM DTT). Single crystals were obtained at 20° C after I 
week in sitting drops by vapor diffusion both in the presence 
of ATP (2 mM) and MgCl2 (2 mM) or in the absence of ATP 
and MgCl2. For data collection, crystals were transferred 
into cryo buffer (2 M sodium formate, 100 mM sodium cit- 
rate pH 4.6, 30% (v/v) glycerol). 

X-ray diffraction data collection 

The X-ray diffraction data of two crystals (Thil7>„-RNA 
and Thil7>„-RNA-ATP) have been collected at the beam- 
line XI 1 of the EMBL outstation Hamburg at DESY, Ger- 
many, at 100 K and a wavelength of 0.9780 A and 0.8140 
A, respectively. The diffraction images of the Thil7>„- 
RNA-ATP complex have been processed with the XDS 
package (27,28) and of the Thil^m-RNA complex with 
MOSFLM/SCALA (29,30). X-ray diffraction data statis- 
tics are summarized in Table 1 . 

Controlled crystal dehydration 

In order to improve the resolution of the X-ray diffraction 
data a controlled crystal dehydration experiment was per- 
formed. The crystal was mounted in a LithoLoop (Molecu- 
lar Dimensions Ltd, Soham, UK) and placed into the head 
of the Free Mounting System (FMS; Proteros Biostructures 
GmbH, Martinsried, Germany) in an adjustable and repro- 
ducible stream of humidified gas, which allows a controlled 
change of relative humidity (31). At the beginning the rela- 
tive humidity was set to 91.8% of that of the mother liquor 
(99.9%). After applying the humidity gradient, the diffrac- 
tion quality of the crystal was monitored by collecting a se- 
ries of diffraction images. The greatest improvement of the 
diffraction properties of the crystal was achieved at a rela- 
tive humidity of 83% causing a reduction of the unit cell vol- 
ume by 15%. When the relative humidity was reduced below 
this value (the lowest 79.8%) the diffraction deteriorated, 
however returning to the optimal relative humidity of 83% 
restored the diffraction properties of the crystal. The crystal 
was flashed cooled after coating it in a thin film of perflu- 
oropolyether oil and used for the further diffraction data 
collection using synchrotron radiation where it diffracted 
X-rays to the maximum resolution limit of 2.85 A. Diffrac- 
tion data were collected at the beamline BL14.1 at BESSY, 
Berlin, Germany (32), at 100 K and a wavelength of 1.0085 
A on a marccd 165 detector in rotation steps of 0.5°. The 
oscillation images have been processed using the XDS pack- 
age (27,28). 
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Table 1 Data collection and refinement statistics 



Dehydrated crystal Non-dehydrated crystal Non-dehydrated crystal 

Thilrm-RNA(FMS) Thilrm-RNA Thilr,^-RNA-ATP 



Data collection 



Beamline 


BL14.1, BESSY 


XI 1, DESY/EMBL 


XI 1, DESY/EMBL 




Berlin, Germany 


Hamburg, Germany 


Hamburg, Germany 


Detector 


MARCCD 165mm 


MARCCD 165mm 


MARCCD 165mm 


Wavelength (A) 


1.008500 


0.978000 


0.814000 


Space group 


P2i2i2i 


P2i2i2i 


P2i2i2i 


Cell dimensions 








a, b, c (A) 


100.65, 110.33, 119.31 


103.28, 113.55, 132.15 


103.17, 113.22, 133.1 


Resolution (A) 


37.18-2.85 


57.07-3.50 


44.45_3.42 




(2.95-2.85)^ 


(3.69-3.50)^ 


(3.52-3.42)^ 


Emerge ^ (%) 


8.5 (62.6) 


8.9 (61.1) 


9.8 (67.0) 


IMI) 


16.42 (2.06) 


9.9 (2.1) 


14.8 (2.5) 


Wilson B (A^) 


57.5 


102.5 


76.5 


Completeness (%) 


96.4 (81.4) 


99.3 (99.9) 


97.9 (84.7) 


Refl. total/unique 


198702/30547 


85440/20061 


76901/21271 


Redundancy 


6.2 (2.7) 


4.3(4.1) 


3.6 (3.3) 


Software used for data processing 


XDS/XSCALE 


MOSFLM/SCALA 


XDS/XSCALE 



Refinement 

Resolution (A) 



No. reflections 

^work (%) 
%ee' (%) 

No. atoms 

Protein 

RNA 

Ligand/ion 
Water 

5-factors (A^) 

Protein 

RNA 

Ligand/ion 
Water 

Solvent content (%) 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Ramachandran plot, 
favored/disallowed (%) 
Coordinate error (A)^ 
PDB code 



37.18-2.85 
(2.95-2.85) 

30363 

18.61 (27.36) 

22.83 (31.53) 

7976 

6231 

1666 

4 

75 

83.3 

66.2 

148.5 

140.7 

55.2 

57.8 

0.012 
0.927 
95.7/0.5 

0.26 
4KR6 



29.75-3.50 
(3.68-3.50) 

19940 

23.02 (30.12) 
26.86 (33.81) 
7860 
6198 
1662 



29.39-3.42 
(3.63-3.42) 

21154 

23.04 (38.58) 

27.38 (41.23) 

7924 

6198 

1662 

64 



159.2 
148.6 
198.6 



103.1 
91.3 
147.3 
94.8 



64.0 



64.1 



0.009 
0.979 
90.2/3.3 



0.010 
0.968 
94.3/1.17 



0.48 
4KR9 



0.60 
4KR7 



^Values in parentheses are for the highest-resolution shell. 

^i^merge = ^hkl lli(hkl) - < I(hkl)>l/I]hkl li(hkl). 

^Rfree factor calculated for 5% randomly chosen reflections not included in the refinement. 
Maximum-likelihood based coordinate error estimated by the refinement program (PHENIX). 



Structure determination of the Thilj^-RNA complex using a 
dehydrated crystal 

The Thil7>^-RNA complex crystallized in the space group 
P2i2i2i (a = 100.65 A, b = 110.33 A, c = 119.31 A, a = 
P = ^ = 90°) with two protein and two minimal substrate 
RNA molecules in the asymmetric unit (solvent content of 
68.7%). The data set collected from the crystal treated by 
controlled dehydration using the FMS was used in order to 
solve the phase problem by Molecular Replacement (MR) 
method and build the model. The known crystal struc- 



ture of Thil^^ (12) sharing 40% identity with Thilj)^ was 
used and the search was performed using PHASER (33). 
First the largest domain (residues 177-388) of the RNA- 
free structure of Thil^^ with all non-conserved residues re- 
placed by serine has been located. The two missing do- 
mains comprising the residues 1-77 and 78-163, respec- 
tively, were placed based on superposition of the Thil^^ 
monomer and optimized by rigid body refinement using the 
program package CNS (34,35). The model was rebuilt in 
COOT (36) and refined with strong non-crystallographic 
symmetry (NCS) restraints using CNS. The NCS restraints 
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were later relaxed for regions showing conformational dif- 
ferences between the two monomers. When the i^-factors 
dropped below 27.5% (i^work) and 30.3% {Rhto), the den- 
sity for oligonucleotides became visible and allowed man- 
ual building of the two TPHE39A molecules. The model 
was validated against SIGMAA-weighted simulated an- 
nealing (SA) (difference) omit maps as calculated with CNS. 
The last refinement steps were performed using PHENIX 
(37,38) with eight translation, libration and screw motion 
(TLS) groups allowing modeling of anisotropic displace- 
ments of individual Thil domains and two RNA molecules. 
The final model consisting of residues Metl-Glu388 and 
nucleotides 1-39 for both protein monomers and RNA 
molecules, 78 water molecules and four partially occupied 
mercury ions (derived from soaking the crystal with HgCl2 
prior FMS treatment) was refined at the resolution of 2.85 
A to i^work and i^free factors of 18.5% and 22.7%, respec- 
tively. Stereochemical quality of the structure was assessed 
by MolProbity (39,40) (95.98% of the residues are located 
within the favored regions of the Ramachandran plot and 
0.52% in the disallowed regions). 

Structure determination of Thilj^^-RNA complexes at low 
resolution 

The Thil7)^-RNA-ATP complex structure has been solved 
by MR using PHASER and the high-resolution struc- 
ture of Thil7)^-RNAFMS as the starting model. Several 
MR searches have been performed in order to localize 
the protein part using separate Thil^m monomers. The re- 
sulting solutions of symmetrical and asymmetrical Thil 
dimers (monomer compositions AA, AB, BB, BA) have 
been scored according to the log-likelihood gain (LLG) val- 
ues as calculated by PHASER. The protein model reveal- 
ing the highest LLG score has been used as a partial so- 
lution for the search of tRNA molecules using truncated 
models of TPHE39A (3'-ACCA end and nucleotides form- 
ing the loop regions, Ul 1 to CI 5 and A21 to C24, have been 
deleted). The structure was manually rebuilt and verified 
against SIGMAA-weighted SA omit maps, composite omit 
maps and difference maps using COOT. The structure was 
refined in CNS using strong NCS restraints between equiv- 
alent residues in the two monomers and weaker ones be- 
tween acceptor and T stems of tRNA molecules. The NCS 
restraints were later relaxed for regions showing different 
conformations in each Thilj)^ monomer (N-termini, loop 
regions 11 to 22, 287 to 295, 333 to 350). For refinement, 
5% of the reflections were randomly chosen and left out for 
cross-validation as the free i^-factor. In order to reduce the 
risk of over-fitting, refinement was based on slow-coohng 
SA (torsion angle dynamics) combined with standard mini- 
mization and grouped 5-f actor refinement. Remaining non- 
interpreted electron density in the active site clearly exhib- 
ited the size and shape of a bound nucleotide triphosphate. 
After adding an ATP molecule to both catalytic centers, 
in each active site a difference peak of mFo-DFc electron 
density at a contour level of 3.5 sigma has been observed 
within 2.5-3.3 A distances to three oxygens of the triphos- 
phate moiety. Those peaks have been interpreted as Mg^"^ 
ions as they were present in the crystallization buffer. Re- 
finement of individual atomic displacement factors for the 



Mg^^ ions resulted in ^-factors in the same range as those 
of the ATP molecules (refined as grouped ^-factor). Non- 
ideal Mg^^-O distances could result from low resolution of 
the diffraction data and consequently larger positional er- 
ror, however we can not rule out the possibility that the ob- 
served difference electron density peaks could correspond 
to a highly coordinated water molecules or a mixture of the 
two mentioned individuals. 

The final protein model consists of residues 3-388 for 
each of the Thilj)^ monomers and two copies of TPHE39A 
(nucleotides 1-39). To model anisotropic displacements of 
the Thil domains and tRNA molecules, we performed the 
final refinement step using PHENIX with eight TLS groups 
at 3.42 A resolution resulting in i^work and i^free factors 
of 23.04% and 27.38%, respectively. The ATP-free Thil^m- 
RNA complex structure is isostructural with the Thil7>„- 
RNA-ATP complex structure and has been refined using 
the same strategy at the resolution of 3.5 A to the i^work and 
i^free factors of 23.02% and 26.86%, respectively. 

Structure analysis 

The contact areas between the two Thil molecules in the 
asymmetric unit were analyzed with the program PISA (41). 
The calculated complexation significance score amounted 
to 1.0, which is the maximal possible value, suggesting 
the existence of a stable homo-dimer. A ^-factor sharp- 
ened SA omit electron density map of ATP was calcu- 
lated with CNS (35). Figures were generated using PyMOL 
(The PyMOL Molecular Graphics System, Version 1.5.0.4 
Schrodinger, LLC). Analysis of RNA secondary structures 
and 2D secondary structure plots were done with the pro- 
gram RNAview (42,43). 

Complementation of coli Athil{iy¥3) 

E. coli Z\f/z//(DE3) cells were transformed with the pET15b 
plasmids encoding either wild-type Thil7>„, or Thil^m 
C165S or Thil C344S. Overnight cultures (3 ml) were 
grown in lysogeny broth (LB) medium in the presence of 
50 |xg/ml ampicillin at 37^C with shaking at 250 rpm and 
used to seed 50 ml cultures. When the cultures reached an 
OD600 = 0.6, expression was induced with isopropyl (B-D-l- 
thiogalactopyranoside (IPTG) (1 mM) and the temperature 
was raised to 45° C for the remainder of the overnight incu- 
bation period. Unfractionated tRNA was isolated, digested 
and analyzed using high pressure Hquid chromatography 
(HPLC) at a wavelength of 330 nm as previously described 
(17). Samples from the Thilj)^ complementation were com- 
pared to a sample from the wt E. coli strain MCI 061 grown 
overnight in LB at 37°C. 

Thilj^ DEAE filter disc activity assay 

Reactions (50 fxl) containing 50 mM Tris, pH 7.5, 5 mM 
Mg(OAc)2, 50 mM KCl, 4 mM ATP, 1 mM DTT, 10 |jlM 
tRNA substrate, 0.5 [xg IscS^m and 0.5 [xg of wt or mutant 
Thil7>„ were mixed and initiated by the addition of 20 fxM 
L-cysteine. The mixtures were incubated at 80^C for 2 min. 
Aliquots (25 |xl) were then removed and apphed to DE81 
filter discs (Whatman) and processed for scintillation count- 
ing as previously described (25). 
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HPLC assay for AMP formation during Thilj^^ catalyzed 
s'^U formation 

Complete reaction mixtures (50 fxl) contained 50 mM Tris, 
pH 7.5, 5 mM MgCls, 50 mM KCl, 0.2 mM ATP, 1 mM 
DTT, 1 |xg IscSrm, 1 mg Thiirm and 20 |jlM TPHE39A (or 
tRNA^^^) RNA. After incubation for 10 min at 37°C, mix- 
tures were analyzed by HPLC using a Supelco LC-18S col- 
umn eluted with a 40 min gradient of 0-20% methanol in 
20 mM potassium phosphate, pH 6.0, with UV detection at 
260 nm. 

RESULTS 

Crystal structure determination 

Thil from the hyperthermophilic bacterium T. maritima 
(Thil7>„) was crystallized with bound truncated tRNA^^^ 
(TPHE39A) from E. coli (26). Thilj)^ is capable to com- 
plement an E. coli Athil strain by modifying U8 of E. 
coli tRNAs (Supplementary Figure S2), and the truncated 
tRNA^^^ TPHE39A from E. coli is in vitro an excellent sub- 
strate for Thil7>„, showing similar levels of s'^U modification 
as a full-length tRNA transcript (Supplementary Figure 
S3). The crystals usually exhibited poor diffraction prop- 
erties, but optimization of the crystallization and crystal 
cryo -protection conditions followed by an extensive crystal 
screening yielded a few crystals diffracting to a resolution 
limit of about 3.5 A. The diffraction quality of one Thil7>„- 
TPHE39A crystal was further improved by controlled de- 
hydration providing data to 2.85 A resolution (Thil7>„- 
RNApMs), concomitantly leading to significant unit cell 
shrinkage (Table 1). The Thil7)^-RNAFMS complex struc- 
ture could be solved by MR only when using separated do- 
mains of the RNA-free structure of Thil^^ (12) as search 
models. After rebuilding the structures of the two Thil 
molecules present in the asymmetric unit, two TPHE39A 
molecules were manually built based on difference elec- 
tron density maps (Supplementary Figure S4), and the crys- 
tal structure of the Thilrm-RNApMS complex was refined 
at a resolution of 2.85 A. Additionally, two Thil7>^-RNA 
complex structures, both from non-dehydrated crystals, 
have been solved by MR using fragments of the Thil7>„- 
RNApMS structure as search models. One of these com- 
plexes is ATP-free (Thil7)^-RNA), while the other one rep- 
resents an ATP-bound form (Thil^m-RNA-ATP). These 
structures were refined at a resolution of 3.50 A and 3.42 A, 
respectively, with reasonable stereochemistry and i^- values 
(Table 1). All structures exhibit elevated values of the atomic 
displacement factors (^-factors) reflecting a high degree of 
crystal disorder most likely caused by the high flexibihty of 
the Thil-RNA complexes (see below), which correlates with 
the limited resolution and the Wilson ^-factors of the cor- 
responding diffraction data. 

Overall structure of the Thil j^-RNA complex 

The Thil7>„ monomer has an elongated overall shape 
and comprises three structural domains: an N-terminal 
ferredoxin-like domain (NFLD: Metl-Lys72), followed by 
a THUMP domain (Gly73-Aspl61) that is connected via 



a short flexible linker region (Argl62-Leul69) with the 
C-terminal PPase domain (Prol70-Glu388) (Figure lA). 
In the 3D structure, the NFLD is located between the 
THUMP and PPase domains, hence, the linker connecting 
the THUMP and PPase domains crosses it. The structures 
of the Thil domains and their overall arrangement are 
similar to that of Thil^^ (12), but the relative positions of 
the domains differ significantly (Supplementary Figure S5). 

The asymmetric unit of the crystals contains two Thilrm 
protein molecules, each binding one RNA molecule. The 
size and properties of the contact surface between the two 
Thil molecules in the asymmetric unit indicate the pres- 
ence of a physiologically relevant homo-dimer (Figure IB). 
The dimer interface buries an area of about 3720 A^, which 
corresponds to about 1 1% of each monomer's solvent acces- 
sible surface area (32,488 A^). The interactions at the inter- 
face include 23 hydrogen bonds and 10 salt bridges, which 
are mainly formed by residues of the PPase domain and 
partially extend to the NFLD (three interactions), but ex- 
clude the THUMP domain. One hydrogen bond is formed 
between the backbone amide group of Glyl68, which is part 
of the flexible linker, and the side chain of Asp319 located 
in the PPase domain of the other monomer. The dimer ob- 
served for the Thil7>^-RNA complex corresponds to the 
dimer of Thil^^ defined by crystallographic 2-fold symme- 
try axis. The superposition of Thil^^ and Thilj)^ dimers 
unveiled that only the PPase domains represent the rigid 
and well superimpo sable part (r.m.s.d. of 1.4 A for 301 Ca 
atoms). 

Each of the two RNA molecules present in the crystal's 
asymmetric unit interacts with both subunits of the Thil 
dimer (Figure IC and D). The RNA is primarily bound 
by the NFLD and THUMP domain, while the bulge re- 
gion harboring the U8 is located close to the active site 
of the PPase domain of the other subunit (Supplemen- 
tary Table SI). The fact that the Thil homo-dimer binds 
the RNA molecule by combining the catalytic PPase do- 
main of one subunit with the NFLD and THUMP domain 
of the other subunit clearly demonstrates that the func- 
tional entity of Thil is a homo-dimer, which can bind two 
RNA molecules simultaneously. Notably, the majority of 
the residues involved in RNA-binding are conserved among 
Thil orthologs. 

Flexibility of the Thilj^ homo-dimer 

The comparison of the Thil^^ and Thil^m-RNA struc- 
tures reveals an asymmetry of the Thil7>„ homo-dimer 
due to different relative orientations of the NFLD and 
THUMP domains with respect to the PPase domains (Fig- 
ure 2A). This flexibility of the protein correlates with dif- 
ferences in the conformation and orientation of the two 
TPHE39A molecules and non-identical RNA-protein in- 
teractions even within each homo-dimer. The homo-dimer's 
asymmetry was assessed by superimposing the PPase do- 
mains of Thil7>„ and Thil^^ and then measuring the rota- 
tional angle and the magnitude of the translation vector re- 
quired to obtain a perfect superposition of the individual 
NFLD and THUMP domains of the Thil^m-RNA com- 
plex structures onto the equivalent domains of the sym- 
metrical Thilsa dimer (Supplementary Table S2). Interest- 
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Figure 1. Structure of the Thil7)„-RNA complex. (A) Ribbon representation of the Thilj^,^ monomer consisting of a NFLD (yellow), a THUMP domain 
(red) and a pyrophosphatase (PPase) domain (blue). (B) Ribbon representation of the Thil homo-dimer. The dimerization interface is mainly formed by the 
PPase domains. (C) The Thil7>„ homo-dimer binds two RNA molecules. The THUMP domain and the NFLD bind the acceptor stem, thereby positioning 
the U8 (position indicated by an arrow) close to the active site of the PPase domain of the other monomer. (D) View perpendicular to (C), showing how 
the three domains belonging to both monomers wrap around the RNA. 



ingly, the observed asymmetry correlates well with the num- 
ber of crystal and protein-RNA contacts and does not 
cause significant structural changes within the individual 
domains as indicated by negligible r.m.s.d. values obtained 
from domain-wise comparison. The largest differences are 
observed for the Thil7)^-RNAFMS structure, most likely 
caused by the crystal dehydration and unit cell shrinkage. 
The Thil7>„-RNA and Thil7>^-RNA-ATP structures show 
a similar, moderate degree of asymmetry indicating that 
ATP binding does not cause large conformational changes 
in the protein. 

RNA conformation 

The conformation of the bound TPHE39A molecules is 
very similar to that of crystallized unbound TPHE39A 
(14) and basically resembles the corresponding regions of 
tRNA^^^ (44) (Figure 3, Supplementary Figure SI). The 
acceptor stem perfectly exhibits the architecture common 
to cytoplasmic tRNAs as the canonical Watson-Crick base 
pairs are maintained. The T-stem equivalent (C15-G28) 



is distorted compared to the canonical tRNA structure 
(C49-G65). However, none of the canonical trans Watson- 
Crick Hoogsteen (U8-A14, U54-A58), trans Watson-Crick 
'Levitt' (G15-C48), trans Watson-Crick sugar-edge (G18- 
U55) and cis Watson-Crick (G19-C56) base pairs found 
in unmodified tRNA^^^ (44) or mature tRNA (45) struc- 
tures can be formed due to the respective lack of one of 
these nucleotides. Interestingly, some of these missing inter- 
actions in TPHE39A are substituted by other interactions 
involving the same set of atoms. Unlike in tRNA^^^, U8 of 
TPHE39A forms a wobble base pair with G28 (Figure 3), 
and A9 of TPHE39A interacts with both GIO of the bulge 
and U17 within the T-stem leading to a compaction of the 
structure. The contact of U54 in tRNA^^^ is replaced by a 
trans Watson-Crick/ Watson-Crick interaction of the cor- 
responding G20 to A23. CI 5, the equivalent of C48, builds a 
trans Watson-Crick/Hoogsteen pair with G12 of the bulge 
in an antiparallel orientation. 

Comparison of the six independent TPHE39A molecules 
of the three dimeric Thil7>„-RNA complex structures iden- 
tified significant conformational differences in the bulge re- 
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3'-ACCA 

Figure 2. Flexibility of the Thil^^-RNA complex structure. (A) Structures of three Thil dimers are compared by the best-fit superposition of their PPase 
domains highlighting the deviation from the symmetric Thil^^^^ structure and the different relative orientations of the THUMP domains. Perfectly symmetric 
Thil^^ is shown in green, the structure of Thilj^^ of the Thil7)„-RNA complex is colored yellow and the structure of Thil 7^,^ obtained from the dehydrated 
crystals of the Thil7>^-RNA complex is colored red. Due to the lack of significant conformational changes of the PPase domains and almost perfect 
superposition the ribbons are not clearly distinguishable for the individual PPase domains. (B) Superposition of the six RNA molecules present in the 
asymmetric units of the three crystals of Thil7>,^-RNA complexes indicating different flexibilities for the 3'-ACCA end (blue rectangular), T-loop (green 
rectangular) and the bulge region (orange rectangular). The acceptor stem (purple rectangular) and T-stem (not marked) are structurally very similar. (C, 
D) Two perpendicular views showing the recognition of the 3'-ACCA end by the THUMP domain. The interactions of the 3'-ACCA end with the THUMP 
domain are almost identical for all six Thil7>^-RNA complexes and apparently independent of their different overall structures. 



gion (A9-C15) and the T-loop (G20-A23), while the accep- 
tor stem and T-stem structures are very similar (Figure 2B, 
Supplementary Table S3). These deviations correlate with 
the number of crystal contacts and elevated atomic B fac- 
tors. In contrast, the 3'-ACCA moiety is not involved in 
any crystal contacts and reveals the lowest atomic B fac- 
tors within the RNA. It is tightly bound by the THUMP 
domain with almost identical conformation and pattern of 
RNA-protein interactions in all ThilTm-R^^ complexes 
(Figure 2C and D). Strikingly, the position of the 3'-ACCA 
with respect to the acceptor stem deviates between RNA 
molecules bound by different Thilj)^ monomers. These dif- 
ferences correlate with a rotation around the bonds con- 
necting the 3'-ACCA end with the acceptor stem, namely 
the phosphate group of A36 and the ribose of C35. These 
observations suggest that variations in relative orientations 
of the RNA-binding domains can be compensated by the in- 
trinsic flexibility of the RNA molecule in order to preserve 
proper placement of the acceptor stem and the bulge region 
harboring U8. 



Structural basis for the specificity of U8 thiolation 

Thil modifies U8 of all bacterial tRNAs, hence there is no 
RNA sequence motif determining the specificity for U8 thi- 
olation. As previously demonstrated in biochemical studies, 
the single-stranded 3'-ACCA end of tRNA^^^ is essential 
for the C4-thiolation of US (25). The Thil^m-RNA crystal 



structures demonstrate that indeed the highly specific bind- 
ing of the 3'-ACCA end by the THUMP domain generates 
a molecular ruler defining the length from the 3'-ACCA end 
to the site of modification. The 3'-ACCA end is bound in a 
groove formed by the (3 -strand Thrl02-Lysl09, the a-helix 
Vail 18-Asnl32 and the loop Phel33-Aspl44, here referred 
to as CCA-binding loop. 

The pattern of interactions between the 3'-ACCA moi- 
ety and the THUMP domain is very similar in all Thilj)^- 
RNA structures (Figure 2C and D). The 3'-terminal A39 
forms both polar contacts and hydrogen bonds with Serl23 
and Vall05, as well as with other nucleotides of the 3'- 
CCA end (Figure 4). In addition, several van-der-Waals 
contacts are made to residues of helix a4 (Asnl22, Serl23, 
Glyl26, Alal27), (B-strand (36 (Lysl04, Vall05) and the 
CCA-binding loop (Vail 3 8). The next-to-last nucleotide, 
C38, forms one hydrogen bond to Tyrll9, a polar interac- 
tion to Argl41 of the CCA-loop, and van-der-Waals con- 
tacts with the side chains of Vail 40 and Argl41. The con- 
formation of C38 is further stabilized by interactions within 
the 3'-ACCA end. Like A39 and C38, C37 forms polar 
interactions with Lysl04 and Vail 40, and within the 3'- 
ACCA. Since A3 6 functions as a flexible linker between the 
acceptor stem and the 3'-ACCA end, its interactions dif- 
fer among the various Thil-RNA complexes. As expected, 
the base of A3 6 forms stacking interactions with its adja- 
cent nucleotides A37 and C35. In most of the complexes 
A36 is not bound by the protein, as it is placed more to- 
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Figure 3. RNA structure. (A) Superposition of the crystal structure of unmodified tRNA^^^ from E. coli (PDB: 3L0U (44)) and the truncated tRNA^^^^ 
(TPHE39A RNA) structure as observed in the complex structure with Thilj)^ . (B, C) Structure-based secondary structure representation of (C) TPHE39A 
RNA (chain X of PDB 4KR7) and (B) equivalent portion of tRNA^^^ (PDB: 3L0U) with all non-canonical interactions depicted as Leontis-Westhof 
symbols (43). The isolated open circles (Q) denote GU wobble pairs. The pair of an open circle and an open square connected by a solid line represents 
trans Watson-Crick/Hoogsteen interactions between antiparallel strands, while trans Watson-Crick/ Watson-Crick pairs between parallel strands are 
denoted by an open circle embedded within a solid line. The filled circle with an arrow (•^) symbolizes cis Watson-Crick/ Sugar interactions between 
antiparallel strands. Dashed lines represent single hydrogen bonds that are neither formed by the Watson-Crick face nor edge-to-edge. The symbols '-' 
and encode AU and GC cis Watson-Crick base pairs, respectively. 



wards the solvent exposed part of the RNA, however, in one 
monomer of the Thil7>„-RNA-ATP complex A3 6 forms 
hydrogen bonds with Lysl04 and Glnl06. The THUMP 
domain does not only bind the 3'-ACCA end but also the 
double-stranded acceptor stem by multiple contacts to nu- 
cleotides from G28 to A39 (for details see Supplementary 
Table SI). In addition to the THUMP domain the NFLD, 
which closely resembles the RNA recognition motif of ribo- 
somal protein S6, also binds the RNA mainly by residues lo- 
cated in the loops Argl0-Lysl9, and Trp44-Arg46, but there 
are also important contacts with residues of a-helix al (e.g. 
Arg21) and of p-strand (32 (e.g. Arg42). 

Catalytic center 

The synthesis of 4-thiouridine requires the activation of 
U8 by adenylation, which leads to a covalent bond be- 
tween 04 of U8 and the a-phosphate of ATP and to the 
release of pyrophosphate. We obtained crystal structures of 
Thil7)^-RNA complexes in the ATP-free as well as ATP- 
bound form. Comparison of both structures depicts that 
ATP binding does not cause large conformational changes. 
The adenosine moiety of the ATP is bound in the active 
site of the PPase domain similarly to the Thil^^-AMP com- 
plex structure (12). Most residues involved in binding of 
the adenosine moiety are conserved between Thil7>„ and 
Thil^^ (except Thr205 which is His208 in Thil^^) and have 
identical side chain conformations . The adenine ring is 
bound deeply in the active site and forms both polar and 
van-der-Waals interactions with Thilj)^ (Leul80, Leul81, 
Met267, Phe206, Serl82, Thr205, Val204) (Figure 5). The 
ribose makes several polar contacts with residues of the PP- 
loop (Leul80, Serl87) and of a loop at the dimer inter- 
face, here denoted as DI-loop (Gly286, Glu287). The phos- 
phate moieties form a network of strong hydrogen bonds 
and polar interactions with the PP-loop (Serl82, Glyl84, 
Ilel85, Aspl86, Serl87), residues of the DI-loop (Asn288, 



Gln291, Gln295) and Arg264. These interactions force the 
tri-phosphate of the bound ATP into a strongly kinked con- 
formation, which should facilitate the nucleophilic attack 
of the flipped U8 on the a-phosphate. Remaining differ- 
ence electron density at the ATP molecule was interpreted 
as Mg^^ ion located between the a- and 7 -phosphate groups 
(Figure 5). 

The cysteine residue 344 within the PPase domain of 
Thil^c was shown to be essential for catalysis (19). Mu- 
tation of the corresponding Cys344 to serine in Thilj)^ 
leads to a complete loss of activity (Supplementary Fig- 
ures S2 and S3). Notably, the active site loop carrying this 
catalytic cysteine is fully disordered in the Thil^^-AMP 
complex structure, whereas it is traceable in the Thil7>„- 
RNA and Thil^m-ATP-RNA structures (Figure 6). While 
no conformational changes are induced upon ATP bind- 
ing, the binding of RNA seems to force the active site loops 
into defined positions. Remarkably, different conformations 
are observed for four loops flanking the active site, when 
comparing the two active sites within the Thil7>„-RNA- 
ATP and Thil7>„-RNA complexes, respectively (Figure 6). 
They concern the NFLD loop Serl2-Arg21 (bottom loop), 
and the loops Pro210-Ser213, Ile289-Leu297 and Lys338- 
Asn351 (upper loop) of the PPase domain. In the Thilj)^- 
ATP-RNA structure both active sites contain ATP, but the 
relative position of the RNA-bulge harboring the U8 with 
respect to the active site is quite different. Thus, the con- 
formation of these active site loops and the position of the 
RNA substrate are directly correlated. Thereby conforma- 
tional changes of the bound RNA are coupled with con- 
formational changes of the active site. These structural re- 
arrangements mainly concern the bottom (Serl2-Arg21) 
and upper (Lys338-Asn351) active site loops that comprise 
Lysl7 and Lysl9 or the catalytic Cys 344, respectively. 

In case of the RNA molecule bound to the NFLD-B 
and THUMP-B, the bulge region is positioned further away 
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Figure 4. RNA-binding by the THUMP domain. (A) Hydrogen bonds between nucleotides of the single-stranded 3'-ACCA end and residues of the 
THUMP domain are shown as black dashed line, and hydrogen bonds within the RNA molecule are shown as yellow dashed line. (B-D) Three different 
enlarged views of (A) showing the interactions of nucleotides A39, C38, C37 and A36 with the THUMP domain. 



from the PPase-A domain and makes only two interac- 
tions with the active site residue Lys350-A (Figure 6A). 
The NFLD-B loop (Serl2-Arg21) folds towards the loop 
Ile289-Leu297 of the PPase-A domain resulting in direct 
interactions. The side chain of Lysl7-B points towards the 
active site of the PPase-A domain, while that of Gln291-A 
contacts the ATP phosphates (Figure 6A). The upper ac- 
tive site loop Ser336-A-Phe347-A has an extended confor- 
mation with the catalytic Cys344-A exposed to the solvent 
resulting in a very wide and open active site of PPase-A. In 
contrast, all three active site loops of the PPase-B domain 
exhibit different conformations (Figure 6B). The RNA is 
positioned closer to the PPase-B domain, so that its bulge 
region makes 17 contacts with the active site residues. The 
vicinity of the bulge region causes conformational changes 
in the NFLD-A loop. As a result, Lysl7-A changes its po- 
sition and is placed close to the RNA molecule so that its 
side chain interacts with the A9 phosphate. The loop Ile289- 
B-Leu297-B changes its conformation so that Gln291-B is 



located at the outer part of the active site with its side chain 
pointing towards the elongated a-helix of the NFLD-A. 
The active site loop of PPase-B folds towards the active site 
and partially covers it. Thus Cys344-B becomes buried and 
gets closer to the ATP, but still leaving the a-phosphate of 
the ATP accessible for the attack by U8 (Supplementary 
Figure S6). 

DISCUSSION 

Thil exhibits a structural modularity that combines a con- 
served catalytic domain with additional domains achiev- 
ing specific substrate recognition, a strategy also applied by 
other RNA-modifying enzymes (46,47). A central role for 
the RNA recognition by Thil plays the THUMP domain, 
which was initially identified in silico as putative RNA- 
binding domain (48). However, the mode of RNA bind- 
ing by a THUMP domain remained elusive, as no structure 
of a THUMP-RNA complex has been known so far. The 
Thil7>„-RNA complex structures unravel how the THUMP 
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Figure 5. Active site of the Thi\Tm PPase domain. The 5-factor sharpened (—80 A^) SA omit mFo-DFc difference electron density map (colored blue) for 
the bound ATP molecule and the Mg^^ ion (MG) is contoured at 3 sigma. Residues of the PPase domain involved in ATP binding are shown as sticks, 
and hydrogen bonds or salt bridges are indicated as dashed lines. 




Figure 6. Two states of the active site loops within the dimeric Thil7>„-RNA-ATP and Thil7>„-RNA complexes. (A, B) Overlay of active sites with or 
without bound ATP are shown for the two Thil7>„ monomers present in the asymmetric unit. The active site loops are colored cyan for monomer A 
and green for monomer B. The inlets show the SA mFo-DFc difference omit maps contoured at 4 sigma (colored blue) of the active site loop containing 
the catalytic Cys344. Residues 340-348 were omitted for electron density map calculation. (C) The superposition of four Thilj)^ monomers shows large 
movement of the active site loop containing the catalytic cysteine (Cys344) leading from an open to a closed state. This loop rearrangement is accompanied 
by conformational changes of three other loops flanking the active site and correlates with the different positions and conformations of the bound RNA 
molecules. 



domain binds both the single-stranded 3'-ACCA end and 
the double-stranded acceptor stem. The strict dependence 
of Thil activity on the 3'-ACCA end obviously guarantees 
that in vivo only correctly processed tRNAs are modified, 
as e.g. a blunt end tRNA reduces s'^US synthesis to below 
0.1% (25). 

The structure of Thil's THUMP domain closely resem- 
bles the ones found in pseudouridine synthase Pus 10 (49), 
tRNA cytidine deaminase CDAT8 (50), and the RNA 



methyltransferases RlmKL (51), Trml4/TrmN (52) and 
RlmM (53). The residues of the Thil^m THUMP domain 
involved in RNA-binding are mainly conserved among all 
THUMP domains, while other solvent exposed areas of the 
Thilj)^ THUMP domain are rather variable (Supplemen- 
tary Figure S7). Interestingly, with exception of Pus 10 all 
of these enzymes contain also a NFLD that forms together 
with the THUMP domain a functional entity similar to that 
of Thil. This applies especially for the tRNA-modifying 
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enzyme CDAT8, which catalyzes a C-to-U editing of ar- 
chaeal tRNAs at position 8 (50). The tRNA is expected to 
be recognized by the dimeric CDAT8 in an identical man- 
ner by the structurally conserved combination of NFLD 
and THUMP, thereby positioning C8 in the active site of 
the deaminase domain belonging to the other subunit of the 
homo-dimer (50). In case of the monomeric Trml4/TrmN, 
which methylates the guanine at position 6 of tRNA^^^, 
a Thil-hke binding of the tRNA by the THUMP/NFLD 
places G6 in proximity to the AdoMet molecule bound to 
the methyltransferases domain (52). However, the THUMP 
domain is not always accompanied by a NFLD, as seen in 
the pseudouridine synthase Pus 10 (49), hence the redefini- 
tion of the THUMP domain by including the NFLD, as 
done by the SCOP database, appears to be misleading. Su- 
perposition of the Thil and Pus 10 THUMP domains re- 
veals that PuslO could bind the 3'-ACCA end like Thil, but 
that the 5' end of the tRNA would clash with the helical ex- 
tension of the PuslO THUMP domain. Hence, conforma- 
tional rearrangements of the RNA and/ or protein are ex- 
pected for the PuslO-tRNA complex, e.g. a rotation of the 
3'-ACCA end relative to the acceptor stem. 

The crystal structures of the Thil7)^-RNA complexes de- 
scribed here demonstrate that only dimeric Thil is able to 
position U8 at the catalytic center of the PPase domain, 
and that the 3'-CCA end common to all tRNAs serves as 
the 'reference point' to measure the distance to the site of 
modification. An acceptor stem shortened by a single base 
pair prevents s^U8 synthesis, which is consistent with the 
structural data and appears to be physiologically reason- 
able as all bacterial tRNAs contain exactly seven base pairs 
in the acceptor stem. Interestingly, homo-dimerization as a 
prerequisite for catalysis has also been observed for some 
other RNA-modifying enzymes such as the pseudouridine 
synthase TruA (54) or CDAT8 (50). 

RNA-complex structures of other tRNA-modifying en- 
zymes revealed that the substrate tRNA is either bound pri- 
marily as a rigid body or it is remodeled significantly thereby 
increasing the affinity and/ or enabling access to an other- 
wise buried target base. Based on these tRNA conforma- 
tions required for catalysis tRNA-modifying enzymes have 
been divided into two families (55). Interestingly, a model of 
Thil with full-length L-shape tRNA obtained by superim- 
posing the acceptor stems of tRNA^^^ onto the Thil-bound 
TPHE39A results in serious clashes between the PPase do- 
main and the tRNA D- and anticodon stems (Supplemen- 
tary Figure S8). Therefore, full-length substrate tRNAs are 
expected to undergo large conformational rearrangements 
upon binding to Thil. This conclusion is supported by cir- 
cular dichroism (CD) spectroscopic studies indicating con- 
formational changes to occur in tRNA^^^ when it binds to 
Thil (14). The non-standard X-form of tRNA observed in 
complex with an archaeal transglycosylase (56) can be ruled 
out to be suitable for s'^U8 synthesis, since it would also 
clash with Thil's catalytic domain. Strikingly, the electro- 
static surface potential of Thil j)^ reveals positively charged 
areas as putative binding sites for the D- and anticodon 
arms of full-length tRNA (Supplementary Figure S8B). 

Although the overall structure of the three domains of 
the tripartite Thilj)^ and Thil^^ orthologs is quite similar, 
significant rearrangements of the individual domains rel- 



ative to each other can occur. They seem to represent an 
essential inherent feature of Thil to accommodate produc- 
tive substrate binding and target U8 recognition. Structural 
changes of loops around the active site in the PPase do- 
main result in two main shapes of its rim. The ATP bind- 
ing pocket is either rather shallow — probably preventing U8 
to flip into it — or larger and wider making it accessible for 
ATP and U8 binding (Supplementary Figure S6). Interest- 
ingly, the information about the conformation of the bound 
RNA seems to be transferred to the ATP binding pocket. 
This signahng cascade employs a set of loops flanking the 
active site that originate on one side from the NFLD of one 
monomer or on the opposite side from the catalytic PPase 
domain. These loops undergo coinciding rearrangements 
and place the catalytically essential Cys344 of the active site 
loop close to the bound ATP when the RNA has a proper 
position and conformation for U8 flipping, thus facilitat- 
ing specific s'^U8 formation. However, the crystal structures 
do not show a flipped out U8 bound in the catalytic cen- 
ter or even the adenylated U8 in case of the ATP complex. 
Probably the flip of U8 and the adenylation reaction only 
occur in the presence of IscS bound to the Thil-RNA com- 
plex, which would be consistent with biochemical studies 
(unpublished). Such an IscS-induced mechanism appears 
reasonable as the adenylated U8 could be attacked by the 
sulfide immediately after the flip and the activating adeny- 
lation reaction, thereby preventing a wasteful hydrolysis 
of the U8-AMP intermediate. To understand the underly- 
ing mechanism to greater extent, structural information on 
the Thil-RNA-IscS complex and the U8-adenylated Thil- 
bound RNA coordinated at the active site is needed. 

The reaction mechanism of 4-thiouridine synthetase ap- 
pears to show striking variability depending on the organ- 
ism. In Thil^c the Cys456 of the RLD forms an essential 
persulfide which was suggested to either directly attack the 
adenylated U8 or to serve as a source for the generation of 
hydrogensulfide anion (HS~) which then performs the nu- 
cleophilic attack (19). In contrast to the RLD-containing 
Thil^c, which utilizes the sulfurtransferase IscS, the RLD- 
deficient Thil from Bacillus subitilis (Thil^^) strictly de- 
pends on the sulfurtransferase NifZ, which transfers the 
sulfhydryl group onto a not yet identified cysteine of Thil^^ 
(22). In vitro, Thil and NifZ are sufficient to catalyze s'^U8 
synthesis, hence no additional RLD-protein is required. Re- 
cent studies on the RLD-deficient Thil from M maripaludis 
showed as well that in this methanogenic archaeon s'^U8 
synthesis does not require the RLD and that a conserved 
CXXC motif within the PPase active site forms the per- 
sulfide and disulfide. This mechanism is reminiscent of 2- 
thiouridine synthetase (MnmA), which exclusively modifies 
U34 in the anticodon loop of some tRNAs. In the crystal 
structure of MnmA^^ a disulfide bond between two active 
site cysteines (Cysl02, Cysl99) was observed (24). 

However, for Thilj)^ the mechanism has to be unique as it 
contains only one cysteine in the active site (Cys344). There 
is only one additional cysteine residue (Cysl65) present in 
Thil^m located in the loop linking NFLD and THUMP do- 
main. However, Cysl65 plays no role, as the C165S mu- 
tant of Thil is as active as wild-type protein both in vivo 
and in vitro (Supplementary Figures S2 and S3), which is 
consistent with the structure as Cysl65 is much too dis- 
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tant from the active site (Supplementary Figures S6C and 
S6D). Since the reaction requires the formation of a disul- 
fide bond, it is tempting to speculate that it is formed be- 
tween Cys344 and the persulfide carrying Cys of the IscS 
sulfurtransferase. The formation of AMP during in vitro 
thiolation of TPHE39A by Thilj)^ indicates the adenyla- 
tion of the substrate (Supplementary Figure S9), however, 
the covalent bond between the a-phosphate and U8 was 
not formed prior to crystallization of the Thil-ATP-RNA 
complex as it was observed e.g. in one of the crystal struc- 
tures of the 2-thiouridine synthetase MnmA (24). Hence, 
the obtained crystal structure of the Thil-ATP-RNA com- 
plex represents an initial inactive state. The mechanism of 
Thil^m activation remains yet unclear, but an involvement 
of IscS in Thilj)^ activation appears to be a plausible option. 
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